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Abstract 

Background: In many countries, substitution of physicians by nurses has become common due to the shortage of 
physicians and the need for high-quality, affordable care, especially for chronic and multi-morbid patients. We 
examined the evidence on the clinical effectiveness and care costs of physician-nurse substitution in primary care. 

Methods: We systematically searched OVID Medline and Embase, The Cochrane Library and CINAHL, up to August 
2012; selected and critically appraised published randomised controlled trials (RCTs) that compared nurse-led care 
with care by primary care physicians on patient satisfaction. Quality of Life (QoL), hospital admission, mortality and 
costs of healthcare. We assessed the individual study risk of bias, calculated the study-specific and pooled relative 
risks (RR) or standardised mean differences (SMD); and performed fixed-effects meta-analyses. 

Results: 24 RCTs (38,974 participants) and 2 economic studies met the inclusion criteria. Pooled analyses showed 
higher overall scores of patient satisfaction with nurse-led care (SMD 0.18, 95% CI 0.13 to 0.23), in RCTs of single 
contact or urgent care, short (less than 6 months) follow-up episodes and in small trials (N < 200). Nurse-led care 
was effective at reducing the overall risk of hospital admission (RR 0.76, 95% CI 0.64 to 0.91), mortality (RR 0.89, 95% 
CI 0.84 to 0.96), in RCTs of on-going or non-urgent care, longer (at least 12 months) follow-up episodes and in 
larger (N > 200) RCTs. Higher quality RCTs (with better allocation concealment and less attrition) showed higher 
rates of hospital admissions and mortality with nurse-led care albeit less or not significant. The results seemed more 
consistent across nurse practitioners than with registered or licensed nurses. The effects of nurse-led care on QoL 
and costs were difficult to interpret due to heterogeneous outcome reporting, valuation of resources and the small 
number of studies. 

Conclusions: The available evidence continues to be limited by the quality of the research considered. Nurse-led 
care seems to have a positive effect on patient satisfaction, hospital admission and mortality. This important finding 
should be confirmed and the determinants of this effect should be assessed in further, larger and more methodically 
rigorous research. 
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Background 

Concerns about the global shortage of health care pro- 
viders [1,2] continue to fuel the debate about the need to 
introduce new strategies of health care delivery. Especially, 
the increasing shortage of physicians makes substitution 
by nurses a common demand which is expected to escal- 
ate with ageing populations and an increasing prevalence 
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of chronic conditions. Two systematic reviews published 
ten years ago suggested that care provided by nurses 
might be equally good as the care provided by physicians 
[3,4]. Health outcomes, use of resources and healthcare 
costs were found to be similar between nurses and physi- 
cians while patient satisfaction was similar or better with 
nurse-led care. These differences, however, were limited 
by the low volume and quality of the studies. In this con- 
text, it is also important to consider that nurses' education 
continues to evolve resulting in different roles and qualifi- 
cations across different health care systems. It seems 
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timely therefore to assess whether the updated evidence 
would support the notion that nurses can substitute physi- 
cians in specific clinical tasks. Therefore, we performed a 
systematic review and meta-analysis of trials investigating 
the clinical effectiveness and costs of nurses working as 
substitutes for physicians in primary care. 

Methods 

We followed a protocol developed prior to starting 
the review and followed the PRISMA guidelines [5] 
for the reporting of systematic reviews and meta-analyses 
(Additional file 1: Table SI). 

Study inclusion/exclusion criteria 

We included peer reviewed randomised controlled trials 
(RCTs) from any country published in English in which 
nurses (in any type of role) substituted physicians by 
acting as the main figure of care with autonomous or 
delegated clinical responsibility for tasks that would have 
formerly been performed by physicians alone: where 
nurse-led care was compared to physician-led care (family 
physicians, paediatricians, and geriatricians); the interven- 
tion had taken place in general practices, community or 
ambulatory care settings; in patients of all ages seeking 
care for all conditions including mental health and ad- 
diction restricted to primary care; and which reported 
on patient satisfaction, quality of life (QoL), hospital 
admission, mortality and cost of health services. Fol- 
lowing the framework published in a Cochrane review 
[3], we excluded studies in which nurses firstly, pro- 
vided services which supplemented or extended the 
care provided by physicians or tasks that are not part of 
the usual care of physicians and secondly, where nurses 
collaborated with other clinicians in a team and thus 
the effect of nurse-led care, as the main intervention, 
could not be distinguished. 

Study identification 

We searched OVID Medline, Embase, CINAHL and The 
Cochrane Library of Systematic Reviews which includes 
the Cochrane Effective Practice and Organisation of 
Care Group, from all available dates until August 2012. 
The searches, not age-, date- or country-specific in- 
cluded 'primary care' , 'skill-mix^ 'physicians'-'nurse' sub- 
stitution' (Additional file 1: Table S2). We also manually 
searched the reference lists of included studies and rele- 
vant reviews. 

Assessment of study quality 

We assessed the risk of bias of all trials without the calcu- 
lation of a composite score following available guidelines 
[6-8]. We considered bias due to attrition of more than 
20% to be of significant concern; and adequate intention- 
to-treat (ITT) if trial authors analysed participants based 



on their original group allocation regardless of protocol vi- 
olations or non-compliance [9]. 

Data extraction 

Both qualitative (characteristics of studies, population and 
interventions) and numeric data (dichotomous and con- 
tinuous format) were extracted using structured data col- 
lection forms, designed and pilot-tested a-priori. If more 
than one comparison group of interest were reported, 
these were combined and compared as one to nurse-led 
care. If the results from a single study were reported in 
more than one publication, data were extracted as one 
study. When one publication reported more than one co- 
hort, data were extracted as separate studies. 

Selection and assessment of studies and acquisition of data 

Two authors independently screened titles and abstracts, 
assessed both the full-text of eligible publications and 
the risk of bias of included studies, and extracted data. 
Differences were resolved through consensus. 

Statistical analyses 

We calculated the individual and pooled unadjusted 
relative risks (RR) and the standardised mean differences 
(SMD); and performed meta-analyses when at least three 
trials reported appropriate data, using the inverse vari- 
ance fixed-effects (FE) method and repeated the analyses 
using a random-effects (RE) model in Cochrane RevMan 
(Version 5.1) [10]. We report the summary statistics, 
their 95% confidence intervals (CI) and consider p < 0.05 
statistically significant. When scales pointed in opposite 
directions, we subtracted the mean from the maximum 
possible value of the scale and estimated the standard de- 
viations (SD) using well-established techniques [11]. We 
analysed dichotomous and continuous data together by 
converting ORs to an effect size expressed in SMD using 
available methods [12]. We decided to use a FE model in 
keeping with: 1) having no basis to assume that the effects 
had a normal distribution, 2) the small number of studies 
in at least two of the analyses, 3) the accuracy in estimates 
and CIs that FE provides even in a small number of stud- 
ies and the more weight assigned to larger studies; RE 
gives similar weight to small and larger studies. We quan- 
tified heterogeneity using the I^ statistic [13] and explored 
the effects of nurse-led care and potential sources of 
1? by pooling data into pre-specified subgroup analyses 
by clinical characteristics: nurses' roles (based on re- 
ported details: nurse practitioner with higher degree 
courses/specialisation (NP-i-) versus nurse practitioner 
(NP) versus registered/licensed nurse (RN/LN), type of 
care (single contact versus on-going care; urgent versus 
non-urgent) and length of follow-up (months: <6 ver- 
sus >6; <12 versus >12). We explored the effect of po- 
tential sources of bias by study size (small, N < 200 versus 
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large, N > 200), allocation concealment (adequate versus 
inadequate/unclear) and attrition (<20% versus >20%), 
and inspected publication bias using funnel plots 
where there were at least 10 trials [14]. We performed 
sensitivity analyses by excluding trials with potentially 
contaminated samples (i.e. patient crossover between 
groups), quasi and cluster design and in which nurses 
had full clinical autonomy (to perform tasks) and/or 
where this information was not reported. For data not 
combined in meta-analyses, individual trial estimates 
were compared. 

Results 

Study identification 

A total of 4,133 original records were identified. We ex- 
cluded 12 of 44 relevant publications for the reasons 
provided in Table S3 (Additional file 1). Twenty-six 
studies reported in 32 publications met the inclusion cri- 
teria and comprised a total of 38,974 randomised partici- 
pants (Figure 1) [15-46]. Twenty-four of the studies 
were RCTs and the other two were economic evalua- 
tions based on three of the appraised RCTs [17,26,38]. 
Table 1 and Table S4 (Additional file 1) report the sum- 
mary and detailed characteristics of participants, inter- 
ventions and outcomes of the trials included in review. 

Study and population characteristics 

There were twenty RCTs of parallel design, three 
cluster-RCTs, one quasi-RCT and two studies [31,38] 
with cost data from three of the included RCTs 
[17,26,39]. The trials were conducted in the UK (n = 9), 
the Netherlands (n = 6), the USA (n = 6), Russia (n = 1) 
and South Africa (n = 2). Median follow-up was 14.8 
(range: 0.5 to 122.4) months with at least 12 months in 
fourteen trials, less than 6 months in seven and 6 to 12 
months in the other three. The median number of partici- 
pants was 1,624 (range: 50 to 12,894) with less than 200 in 
eleven trials and more than 200 in the other thirteen. Mean 
age was reported in twenty trials and ranged from 10 to 83 
years. Twenty-two trials reported on gender and 38.3% of 
the participants were male; one included women only. 

Settings and interventions 

A summary of settings, interventions and nurses' roles are 
reported in Table 1 and Table S5 (Additional file 1). Nurses 
worked as physician substitutes in a range of care settings. 
The interventions were carried-out in general practices 
[17,18,26,29,30,33,39,44], nurse clinics [23,36,40,42] and in 
hospital-based, health care centres, specialised practices, 
community or university clinics [15,16,22,25,27,28,32, 
34,35,37,46]. In the controlled intervention, nurses were the 
main figure of care with autonomous or delegated responsi- 
bility in various clinical domains including a whole range of 
possible (undifferentiated/minor acute/common) or specific 



conditions (e.g. hypertension, heart failure, diabetes, HIV, 
etc.). In one trial, the clinical domain was assumed to repre- 
sent undifferentiated care [35]. Nurses' specific qualifica- 
tions and training were not reported in sufficient detail but 
using the information provided by study authors we 
grouped nurses' roles. Nurses' roles were described in 
some detail in sixteen trials (two reported in one publica- 
tion) [15-18,22,23,27-29,32,36,37,39,42,46]. Seven trials 
employed NP+ only [16,25,28,32,39,40,42], six employed 
NPs [18,22,26,30,34,44], eight employed RN and/or LN 
[15,23,27,29,33,35-37], one employed NP and NP+ [17], 
and one employed NP and LN [46]. Nurses' interventions 
were guideline- or protocol-based in eighteen trials, while 
six had no report of having followed specific guidelines. 
Nurses' clinical autonomy was obtainable from twenty-two 
trials. In three, nurses had full clinical autonomy to manage 
patients with diabetes type II [30] or undifferentiated condi- 
tions [34,35]. In the other nineteen, nurses made independ- 
ent decisions to perform several tasks (e.g. adopting, 
initiating and prescribing treatment, ordering tests or refer- 
rals) but they still required minor support or contact with 
the physicians (e.g. to sign prescriptions, referrals and tests, 
to discuss patients' records or to develop action plans). Al- 
though the interventions in the control group were not 
clearly described in at least a few trials, these were assumed 
to represent physicians-usual-led care. Ten trials addressed 
single contact care [15,39], single contact and on-going care 
[27,35,37], single contact and urgent care [17,18,26,34] and 
single, on-going and urgent care [22]. The other fourteen 
included patients in on-going care for complex conditions 
(e.g. HIV, Asthma, hypertension, heart failure, etc.). 

Risk of bias in the methods of the included studies 

The overall quality of the studies varied substantially 
when assessed against current reporting standards [6,11] 
(Table 2). Only 54.3% of the trials measured the success 
of the intervention by defining a primary outcome. Ran- 
dom sequence generation was adequate in 54.0%, alloca- 
tion concealment in 42.0%, blinding of patients and 
providers in 4% and blinding of outcome assessors in 
21.0%. Patient or clinician crossover between groups was 
reported in 12.5% of the trials. At baseline, groups were 
comparable for all tested factors in 70.8% of the trials. 
Both inclusion and exclusion criteria were reported in 
70.8%. Sample size calculation based on power (80.0% to 
90.0%) was performed in 70.8%, but only ten trials held 
the least target sample size to achieve power in at least 
one outcome. Rates of missing data varied widely (range: 
5.0% to 65.5%). While three did not report any attrition, 
more than half (13/24) of the trials had an attrition rate 
of at least 20%: nine had more than 20% in both arms in 
at least one outcome (range: 10.0 to 65.5%), three had at 
least 45.0% per arm, and four had more than 20% (range: 
8.0 to 30.0%) in one arm with a differential rate of 9.0% 
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Records identified per database searched: 
COCHRANE (n = 362) MEDLINE (n = 1348) 
EMBASE (n = 1204) CINAHL (n = 1429) 



Records identified manually 
searching reference lists of included 
studies and key reviews (n = 440) 



Total records identified through database 
searching (n = 4343) 



Records after removing duplicates (n =4133) 



Records screened (n = 3960) 



Excluded 
(n = 3480) 



Records eligible for detailed evaluation (n =480) 



Excluded 
(n=217) 



Full-text examination (n = 263) 



Excluded 
{n = 219) 



Relevant for appraisal examination 
(n = 44) 



Excluded at 
data approbal 
(n = 12) 



Studies included in systematic review 
26 reported in 32 publications 



Figure 1 PRISMA Flow Diagram - study selection process. 



to 15.0% across the treatment and control groups. Only 
29.2% of the trials reported the use of intention to treat 
(ITT) techniques (type not always reported) to deal with 
missing data. 

Effectiveness of interventions 
Patient satisfaction with quality of care 

Patient satisfaction questionnaires were either validated 
[18,22,26,30], developed for the study purpose [15] or 
had unclear validation [17,23,37,39]. Meta-analysis of 
seven studies showed a significant increase in the mean 
satisfaction scores with nurse-led care (SMD 0.18, 95% 
CI 0.13 to 0.23) and significant heterogeneity between 
trials (I^ = 91%; xldf = 65.97; p < 0.00001) (Figure 2). 



Subgroup analyses by clinical characteristics showed that 
RN had a stronger effect than NPs in increasing patient 
satisfaction, although the pooled CIs became wider due 
to both the smaller number of studies and smaller sam- 
ple sizes (SMD 1.37, 95% CI 0.88 to 1.85). The effect es- 
timate also increased in studies of single contact care, 
urgent care visits and shorter (less than 6 months) 
follow-up episodes, but the significance of the findings 
did not change. On the other hand, the effect disap- 
peared in studies of on-going care, non-urgent care visits 
and longer (greater than 6 months) follow-up episodes. 
Subgroup analyses by study quality showed a more mod- 
est estimate with the same level of significance in larger 
trials, which are less prone to small study bias (N > 200: 



Table 1 Summary characteristics of participants and interventions of studies included in review 



study 



Location, first 
author, year 



Design, 
period* 



FUP, 
m 



Setting Participants Nurses' group 

Facilities, n Included diagnosis Nurses, 



Physicians' group 



Patients, 
N 



Mean 
age 
(SD), y 



Male, Phys., 



Patients, 
N 



Mean 
age (SD), 

y 



Intervention Outcomes reported 

Male, Nurses' training/ FCA GDL PS QoL HA M C 
% experience 



ZA 2 

Fairall, 2012 
[36], cohort 2 



ZA 1 

Fairall, 2012 
[36], cohort 1 



NL6 

Houweling, 
2011 [30] 



NL 5 

Kuethe, 201 1 
[25] 

RU 1 

Andryukhin, 
2010 [46] 



NL4 

Voogdt-Pruis, 
2010 [16] 



NL3 



cRCT, 
2008- 
2010. 



cRCT, 
2008- 
2010. 



RCT, 

period 

nr. 



RCT, 

2006- 

2008. 

RCT, 
2006 
-2009. 



RCT 

2006- 

2007. 



RCT 
2006. 



Nurse ART 
clinic, 31. 



16-18 Nurse ART 
clinic, 31. 



14 



12 



0.5 



Practice, 1. 



24 



6, 18 



Hospita 
outpatients, 
1; Practice, 
18. 

IVledical 
centre 
practice, 1. 



Healthcare 
centre, 6. 



HIV/AIDS. 



103 



HIV/AIDS. 



103 



Diabetes IVlellitus 
Type II. 



Asthma. 



Heart Failure with 
Preserved Fjection 
Fracture. 



10 



CVD, Hypertension, 6 
Hypercholesterolemia. 



Practice, 15; Common complaints. 12 
Reference, 5 



6415 



6159 



116 



36 



50 



817 



38 (8.9) 30 



36 (9.6) 33 



67.1 
(11) 



11.2 
(2.9) 



66.5 
(3.2) 



42.8 
(16.5) 



53 



64 



27 



38 



6479 



4923 



114 



71 (37', 
34*) 



50 



64 (9.0) 58 25 816 



50/17* 684 



i (9.63) 27 



35 (9.63) 31 



69.5 
(10.6) 



46.1 
(16.6) 



42 



11.2(2.5) 58 
'•; 10.1 
(2.6)* 



68 (4.3) 34 



64 (9.0) 62 



40 



Middle nurse 
managers trained 
to assume 
responsibility for 
ART and 
established 
patients' eligibility 
for ART. 

Middle nurse 
managers trained 
to assume 
responsibility for 
ART and 
established 
patients' eligibility 
for ART 

Practice nurse 
with one week 
training in 
diabetes mellitus; 
nurse had no 
special training in 
the treatment of 
diabetes prior to 
starting trial. 

Asthma nurse. 



Nurses with 
special degree in 
patient education 
obtained in a joint 
course. 

Advance practice 
nurse already 
employed to 
manage patients 
with asthma, 
chronic 
obstructive 
pulmonary 
disease, or 
diabetes. 

Nurse practitioner 
with IVlaster 



yes 



yes 



yes yes 



yes 



yes 



yes 



yes J ■/ 



Table 1 Summary characteristics of participants and interventions of studies included in review (Continued) 



Dierick-Van 
Dale, 2009 [39] 



UK 9 

Chan, 2009 
[42] 

NL2 

Du Moulin, 
2007 [37] 

US 6 

Hiss, 2007 [32] 
NL 1 

Hesselink, 2004 
[33] 



UK 8 

Denver, 2003 
[40] 

UK 7 

Jarman, 2002 
[29] 

UK 6 

Kernick, 2002 
[27] 

US 5 

IVlundinger, 
2000 [22,24] 

UK 5 Kernick 
2000 [28] 



UK 4 

Kinnersley, 
2000 [26] 



UK 3 



RCT, 

2002- 

2004. 

CRCT, 

period 

nr. 

RCT, 

period 

nr. 

RCT, 

2000- 

200T 



RCT, 

2000- 

200T 

RCT 

1996- 

1999. 

RCT 

period 

nr. 

RCT 

1995- 

1997. 

RCT 
period 



RCT 
period 



RCT 
period 



6-12, 
24 



Nurse clinic, GORD, moderate 
1 Gastritis. 



12 nr. 



All forms of 1 
incontinence. 



Community, Diabetes Mellitus. nr 
2; PHD, 1. 



24 Practice, 12 Asthma and COPD. 2 



Nurse clinic 

hospita 
based. 



Diabetes IVIellitus 
Type II pre-diagnosed 
with Hypertension or 
in receipt of BPLT 



24 Practice, 438. Parkinson's Disease. 



Health 
Centre 
Practice, 1 

Community 
clinic, 4; 
Primary care 
clinic, 1. 

Health 
Centre, 1 



Asthma 



Asthma, Diabetes 7 
Mellitus, 

Hypertension, or 
urgent visits. 

Psoriasis and Eczema. 1 



0.5-1 Practice, 10 Diverse complaints. 



0.5 Practice, 20 



Diverse complaints: 
e.g. minor injuries. 



38 



95 



139 



12 



20 (1 
per 

practice) 



60 



1041 



55 



1181 



55 



1465 II 



651 



50.2 49 
(13.9) 



51 0 
(13.0) 



58.1 

(i3.b: 



Median 56 
(IQR): 35 
(29-47) 

44 24 



474 39 
(18.4) 



range: 0 39 
to >75 



42 



28" 1 3 



55.7 32 108' 102 
(13.1) 

49.9 35 14 137 
(14.2) 



57 nr 



57 



10 



60 



54 



146511 



665 



484 49 
(12.8) 



51 (13.0) 0 



57(11.4) 35 



44.7 28 
(13.6) 



624(9.1) 70 



nr 56 



Median 33 
(IQR): 37 
(27-50) 

44.9 22 



51.7 
(15.8) 



range: 42 

0->75 



nr 43 



degree in Advance 
Nursing trained in 
common 
complaints. 

Gastrointestina 
nurse practitioner. 



Registered nurse 
specialist in 
incontinence. 

Diabetes nurse. 



GP assistant with 
pre- and during- 
trial training to 
deal with the dif- 
ferences between 
asthma and COPD. 

Hypertension 
nurse. 



Community nurse 
with a course in 
Parkinson Disease. 

Nurse with 
structured training 
in Asthma care. 



yes 



yes 



yes 



yes 



yes 



yes 



Practice nurse 
with training in 
psoriasis and 
eczema 
management. 

Nurse practitioner 
with diploma on 
care for same day 
consultations for 
primary care. 

Nurse with course 
at BSc or MSc 
level. 



yes 



Community nurse no nr J J 
practitioner. 
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Venning, 2000 
[17] 

UK 2 

Shum, 2000 
[18] 



US 4 

Hemani, 1999 
[34] 



RCT, 

1998- 

1999. 



qRCT, 
1999- 
2001. 



UK 1 RCT, 
1995- 

Campbell, 1998 ^ggg 
[19-21,41,43-45] 



US 3 

Winter, 
[15] 



981 



RCT, 
1980. 



US 2 

Flynn, 1974 
[35] 

US 1 

Lewis, 1967 
[23] 



RCT, 
1971. 



RCT, 

period 

nr. 



respiratory 
complaints. 

0.5 Practice, 5 Acute minor illnesses. 5 



12 



12, 
24, 
56.4 
122.4 



05 



Primary care 
veterans 
affair clinic, 
1 



Practice, 1 9 



Community 
clinic, 1 



>6 



12 



HO clinic, 1; 
Private, 3 



University 
Hospita 
clinic, 1; 
Nurse clinic, 
1 



Undifferentiated 
conditions. 



Coronary Heart 
Disease secondary 
prevention. 



Family planning, 
venereal diseases, 
acute non-traumatic 
minor illnesses. 



28 



900 



150 



673 



median 40 

(IQR):26 

(9-41.8) 



62 



99 



58 



19 



45 



915 



300 



670 



median 

(IQR):29.1 

(9.7-44.9) 



40 



66.3 (8.2) 58 



(8.2) 



25 



25 



Undifferentiated. 



Hypertension, CVD, 
Obesity, Arthritis, 
Somatization. 



40 



33 



range: 12 
16-78 



20 



33 



range: 
16-83 



12 



Practice nurse 
with a course in 
minor illnesses 
and piloted before 
study. 

Nurse practitioner 
who was on staff 
for at least six 
months in primary 
care. 

District and 
practice nurses 
trained in clinic 
protocols/GDLs for 
behavioural 
techniques 
change. 

Registered 
professional nurse 
with preparation 
and skills in 
physical diagnosis, 
psychosocia 
assessment, and 
health-illness man- 
agement in pri- 
mary care. 

Nurse clinicians 
with training in 
service delivery. 

Nurses who 
provided primary 
source care for at 
least one year 
before the study. 



yes yes 



yes 



yes 



yes 



yes ■/ •/ 



Legend. 

Studies are listed by year (y) of publication, in decreasing order; and labelled after the country where they were conducted. 

US, United States; NL, The Netherlands; UK, United Kingdom; ZA, South Africa; RU, Russia; RCT, Randomised Controlled Trial; cRCT, cluster Randomised Controlled Trial; qRCT, quasi-Randomised Controlled Trial; PUP, 

follow-up episodes are reported in months (m); nr, not reported; ART, Antiretroviral Therapy; PHD, public health department; HO, Hospital Outpatients; HIV, Human Immunodeficiency Virus; CVD, Cardiovascular Disease; 

GORD, Gastro-Oesophageal Reflux Disease; COPD, Chronic Obstructive Pulmonary Disease; BPLT, Blood Pressure Lowering Treatment; SD, standard deviation; IQR, Interquartile Ranges; 

FCA, full clinical autonomy; GDL, interventions based on clinical guidelines or protocols; PS, patient satisfaction; QoL, quality of life; HA, hospital admissions; M, mortality; C, costs. 

*Start and end year when studies were conducted. 

^Reference practices for comparison on economic/cost data. 

^Paediatricians. 

^General physicians. 

"Number of randomized patients per group not reported. 
^Nine were physicians and nineteen were supervisors. 
*Sixty-three were for the control group. 
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Table 2 Quality of methods In the studies included in review 



Study details (country. Inclusion & Outcome Sequence Allocation Blinding Sample Attrition Funding 

design, funding) exclusion criteria 2ry generation concealment size % 

ZA 2 ✓ ✓ ✓ A A NP* /II >20* G 

Fairall, 2012 [36] (Cohort 2) 

ZA 1 y ✓ A A NP* /ll >20» G 

Fairall, 2012 [36] (Cohort 1) 

NL6 / ✓ i A NP / <20 G 

Houweling, 201 1 [30] 

NL5 ✓ A A NP <20 NR 

Kuethe, 201 1 [25] 

RU 1 ✓ U I * >20 None 

Andryukhin, 2010 [46] 

NL4 ✓ ✓ A U 1*'^ / <20 P/lnd. 

Voogdt-Pruis, 2010 [16] 

NL3 A A NP NP >20 G 

Dierick-Van Dale, 2009 [39] 

UK 9 A A NP^ y <20* NR 

Chan, 2009 [42] 

NL2 y y u u NP yll >2o nr 

Du Moulin, 2007 [37] 

USA 6 * U U NP NP <20 G 

Hiss, 2007 [32] 

NL 1 * y y u u NP'' yll >2o nr 

Hesselink 2004 [33] 

UK 8 * y y I I NP y' <20» nr 

Denver, 2003 [40] 

UK 7 y y y A a np y <2o p/ind. 

Jarman, 2002 [29] 

UK 6 ""^yyA uuy >20* nr 

Kernick, 2002 [27] 

US 5 * U U NP y' >20 G 

Mundinger, 2000 [22,24] 

UK 5 y y A U U y' >20 Ind. 

Kernick, 2000 [28] 

UK 4 y y y A a np y^'' >2o g 

Kinnersley, 2000 [26] 

UK 3 y A A NP NrII'' >20 P 

Venning, 2000 [17] 

UK 2 y A A NP y' >20 G 

Shum, 2000 [18] 

US 4 y I I NP NP U NR 

Hemani, 1999 [34] 

UK 1 y A I NP'' y >20* G 

Campbell, 1998 [19-21,41,43-45] 

US 3 y y U U A* NR U NR 
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Table 2 Quality of methods in the studies included in review (Continued) 

Winter, 1981 [15] 
US 2 

Flynn, 1974 [35] 
US 1 

Lewis, 1 967 [23] 
Legend. 

Studies are listed by year (y) of publication, in decreasing order. Blinding: wfietfier patients, care providers and outcome assessors were blinded. Attrition of more 
than 20% is of significant concern. Intention to treat (ITT) whether study authors analysed all patients based on their original group allocation regardless of 
protocol violations or non-compliance. US, United States; NL, The Netherlands; UK, United Kingdom; ZA, South Africa; RU, Russia; I, Inadequate; A: Adequate; 
U, Unclear; NP, Not Performed; NR, Not reported; Funding, Government (G), Industry (Ind.) or Private (P) grant. 
*Only the inclusion criteria was reported. 

^Not all factors tested at baseline were comparable between groups. 

*Fairall et al. (2012) [36] partly blinded data analysts; Andryukhin et al. (2010) [46] blinded clinicians not patients; Voogdt-Pruis et al. (2010) [16] blinded patients 

not clinicians; Winter (1981) [15] blinded patients and clinicians. 

^Outcome assessors blinded for some or all outcomes. 

^Used a cluster effect approach (e.g. Huber-White). 

^Reached the least target sample required to achieve power. 

''Used ITT strategies to deal with missing data. 



U U NP NR <20 NR 

U U NP*" NR U* G 



study/Subgroup RCTs, Nurses, Physicians, IV, Fixed Weigtit SIV1D(95%CI) Heterogeneity Oueraii 





n 


Mean (SD) 


N 


Mean (SD) 


N 






% 




Chi' 


df 


P 




effect, P 


A) all trials 






























IVIundinger, 2000 




4.45 (0.89) 


644 


4.46 (0.89) 


389 


H 


1- 


17.3 


-0.01 (-0.14 to 0.11) 












Dierick-van Daele, 2009 




8.19 (1.18) 683 


8.2 (1.26) 


609 


i 


i 


23.0 


-0.01 (-0.12 to 0.10) 












Shum, 2000 




78.6 (16) 


635 


76.4 (17.8) 


657 




■ 


23.0 


0.13 (0.02 to 0.24) 












Venning, 2000 




4.4 (0.5) 


608 


4.22 (0,5) 


571 




■ 


20.7 


0.36 (0,24 to 0.47) 












Kinnersley, 2000 




77.9 (9.5) 


334 


74.0 (9.2) 


596 




■ 


15.0 


0.42 (0.28 to 0.55) 












DuMoulin, 2007 




8.7 (1) 


35 


7.5 (1) 


10 






0.5 


1.18 (0.43 to 1.93) 












Winter, 1981 




94.78 (9) 


25 


74.03 (17) 


25 






— 0.7 


1,50 (0,87 to 2.14) 












Overall (95% CI) 


7 




2964 




2857 






100 


0.18 (0.13 to 0.23) 


65.97 


6 


<0.00001 


91 


<0.00001 


B) subgroup 






























Employed nurses 






























NP 


3 




1613 




1642 




♦ 




0.16 (0.09 to 0.23) 


21.42 


2 


<0.0001 


91 


<0.00001 


NP+ 


2 




1291 




1180 




♦ 




0.17 (0.09 to 0.25) 


20.65 


1 


<0.00001 


95 


<0.0001 


RN 


2 




60 




35 








1.37 (0.88 to 1.85) 


0.42 


1 


0.52 


0 


<0.00001 


Type of care 






























single contact 


5 




2285 




2458 




♦ 




0.21 (0.16 to 0.27) 


48.99 


4 


<0.00001 


92 


<0.00001 


ongoing care 


2 




679 




399 








0.02 (-0.10 to 0.15) 


9.47 


1 


0.002 


89 


0.73 


urgent care visits 


4 




2221 




2213 




♦ 




0.22 (0.16 to 0.28) 


29.52 


3 


<0.00001 


90 


<0.00001 


non-urgent visits 


3 




743 




644 


■ 






0.06 (-0.05 to 0.17) 


29.99 


2 


<0.00001 


93 


0.28 


Follow-up 






























<6 months 


5 




2285 




2458 




♦ 




0.21 (0.16 to 0.27) 


48.99 


4 


<0,00001 


92 


<0.00001 


>6 months 


2 




679 




399 








0.02 (-0.10 to 0.15) 


9.47 


1 


0.002 


89 


0.73 


Study size 






























NS200 


2 




60 




35 








1.37 (0.88 to 1.85) 


0.42 


1 


0.52 


0 


<0.00001 


N>200 


5 




2904 




2822 








0,16 (0,11 to 0.22) 


42,08 


4 


<0,00001 


90 


<0.00001 


Risk of bias* 






























adequate concealment 


4 




2260 




2433 




♦ 




0.20 (0.14 to 0.26) 


32.96 


3 


<0,00001 


91 


<0.00001 


inadequate concealment 


3 




704 




424 


■ 






0.08 (-0.05 to 0.20) 


29.66 


2 


<0,00001 


93 


0.22 



-2-1 0 12 



Setter with PLC Better with NLC 

Figure 2 Effects of physician-nurse substitution on patient satisfaction in A) all trials and by B) subgroups. Legend. CI, confidence 
interval; df, degrees of freedom; N, total number of patients; SMD, standard mean differences; SD, standard deviation; Chi^, statistical test for 
heterogeneity; P, p-value of Chi^ (evidence of heterogeneity of intervention effects); 1^, amount of heterogeneity between trials; Overall P, p-value 
for significance of effects of interventions; NLC, Nurse-Led Care; PLC, Physician-Led Care; NP, Nurse Practitioner; NP+, Nurse Practitioner with 
higher degree/courses/specialisation; RN, Registered Nurse. All trials had >20% attrition in at least one arm. 
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study/Subgroup 



Nurses 
N/Total 



Physicians 
N/Total 



IV, Fixed 



Weight RR(95%CI) 



Heterogeneity Overall 
' df P \',% effect, P 



A) all trials 

Shum, 2000 
Campbell, 1998 
Andryukhin, 2010 
Mundinger, 2000 
Flynn, 1974 

Overall (95% CI) 5 

B) subgroups 
Employed nurses 

NP 3 
RN/LN 2 

Type of care 

single contact 3 

ongoing care 2 

urgent care visits 2 

non-urgent visits 3 

Follow-up* 

<12 months 4 
>12 montlis 3 

Study size 

NS200 2 

N>200 3 
Risk of bias 

adequate concealment 1 

inadequate concealment 4 

<20% attrition 1 

>20% attrition 4 



1/6S4 
106/540 
9/44 
68/800 
10/40 
194/2108 



175/2024 
19/84 

79/1524 
115/584 
69/1484 
125/624 

48/1568 
183/1384 

19/84 
175/2024 

1/684 
193/1424 

10/40 
184/2068 



3/694 
145/518 
10/41 
50/509 
2/20 
210/1782 



198/1721 
12/61 

55/1223 
155/559 
53/1203 
157/579 

42/1264 
205/1068 

12/61 
198/1721 

3/694 
207/1088 

2/20 
208/1762 



0.6 0.34 (0.04 to 3.24) 

66.3 0.70 (0.56 to 0.87) 
5.1 0.84 (0.38 to 1.85) 

26.4 0.87 (0.61 to 1.23) 
1.6 2.50 (0.60 to 10.34) 
100 0.76 (0.64 to 0.91) 



0.74 (0.62 to 0.89) 



0.90 (0.64 to 1.26) 
0.71 (0.57 to 0.88) 
0.85 (0.60 to 1.19) 
0.73 (0.59 to 0.90) 

0.74 (0.49 to 1.12) 
0.75 (0.63 to 0.90) 

1.09 (0.54 to 2.17) 



0.34 (0.04 to 3.24) 
0.76 (0.64 to 0.91) 
2.50 (0.60 to 10.34) 
0.75 (0.62 to 0.89) 



4.30 


4 


0.37 


7 


0.003 


1.47 


2 


0.48 


0 


0.001 


1.73 


1 


0.19 


42 


0.81 


2.76 


2 


0.25 


27 


0.53 


0.18 


1 


0.67 


0 


0.002 


0.65 


1 


0.42 


0 


0.34 


3.13 


2 


0.21 


36 


0.003 


3.94 


3 


0.27 


24 


0.16 


1.09 


2 


0.58 


0 


0.002 


1.73 


1 


0.19 


42 


0.81 


1.47 


2 


0.48 


0 


0.001 


NA 








0.35 


3.81 


3 


0.28 


21 


0.003 


NA 








0.21 


1.56 


3 


0.67 


0 


0.001 



Lower with NiC Lower witt) PLC 

Figure 3 Effects of physician-nurse substitution on hospital admissions in A) all trials and by B) subgroups. Legend CI, confidence 
interval; df, degrees of freedom; N, number of patients with events; Total, total number of patients per group; RR, Relative Risk; Chi^ statistical test 
for heterogeneity; P, p-value of Chi^ (evidence of heterogeneity of intervention effects); 1^, amount of heterogeneity between trials; Overall P, 
p-value for significance of effects of interventions; NLC, Nurse-Led Care; PLC, Physician-Led Care; NP, Nurse Practitioner; NP+, Nurse Practitioner 
with higher degree/courses/specialisation; RN, Registered Nurse. Two RCTs provided data for different follow-up episodes and were incorporated 
accordingly: Andryukhin et al. (2010) [46] reported data at 6 and 18 months and Mundinger et al. (2000) [22,24] reported data at 6 and 12 months. 



SMD 0.16, 95% CI 0.11 to 0.22; N < 200: SMD 1.37, 95% 
CI 0.88 to 1.85). The effect was not significant in trials 
with inadequate allocation concealment. All trials had at 
least 20% attrition. Heterogeneity disappeared in the sub- 
group of registered nurses and smaller trials (N < 200). 
Two other trials with qualitative data reported signifi- 
cantly higher patient satisfaction scores with nurse-led 
care [23,30]. 

Hospital admissions 

Five trials had sufficient data for meta-analysis (Figure 3), 
two of which reported different follow-up episodes 
[22,46]. The pooled RR showed a significant reduction 
in the risk of all-cause hospital admissions with nurse- 
led care (RRs 0.76, 95% CI 0.64 to 0.91) and no signifi- 
cant heterogeneity between trials (I^ = 7%; xidf = 4.30; 
p = 0.37). Subgroup analyses by clinical characteristics 
showed that NPs had a positive effect in reducing all- 
cause admissions to hospital (RRs 0.74, 95% CI 0.62 to 



0.89) while the effect was not significant with RNs. The 
estimate increased in studies of on-going care, non- 
urgent visits and longer (at least 12 months) follow-up 
episodes. The effect disappeared in trials of single con- 
tact care, urgent care and shorter (less than 12 months) 
follow-up episodes. Subgroup analyses by study quality 
showed that in large trials (less prone to bias) nurse-led 
care had an increasingly significant effect in reducing 
hospital admissions (N < 200: RR 1.09, 95% CI 0.54 to 
2.17; N > 200: RR 0.74, 95% CI 0.62 to 0.89). However, 
trials that were of higher quality in other ways (e.g. better 
allocation concealment and less attrition) tend to show 
the opposite effect with better quality being associated 
with higher rates of admissions with nurse-led care, albeit 
non-significant. Heterogeneity remained non-significant 
across subgroups and disappeared in studies of nurse 
practitioners, on-going and urgent care, longer follow-up 
episodes, larger trials and trials with at least 20% attrition. 
In addition, data that were not pooled showed less 
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study/Subgroup 



RCTs, Nurses 



Physicians 



IV, Fixed 







N/Total 


N/Total 


A) all trials 








Hiss, 2007 




0/95 


3/102 


Voogdt-Pruis, 2010 




1/314 


7/387 


Shum, 2000 




0/684 


2/694 


Hesselink, 2004 




0/115 


1/94 


Denver, 2003 




0/60 


1/60 


Hemani, 1999 




3/150 


9/300 


Campbell, 1998 




100/673 


128/670 


Jarman, 2002 




353/1016 


307/803 


Fairall, 2012) 




997/4943 


747/3407 


Andryukhin, 2010 




2/44 


2/41 


Overall (95% CI) 


10 


1456/8094 


1207/6558 


B) subgroups 








Employed nurses 








NP 


3 


103/1507 


139/1664 


NP+ 


3 


1/469 


11/549 


RN/LN 


4 


1352/6118 


1057/4345 


Type of care 








single contact 


2 


3/834 


11/994 


ongoing care 


8 


1453/7260 


1196/5564 


urgent care visits 


1 


0/684 


2/694 


non-urgent visits 


9 


1456/7410 


1205/5864 


Follow-up* 








<12 months 


4 


1/883 


8/897 


>12 months 


7 


1456/7255 


1201/5702 


Study size 








N<200 


4 


5/349 


15/503 


N>200 


6 


1451/7745 


1192/6055 


Risk of bias 








adequate concealment 


3 


1350/6643 


1056/4904 


inadequate concealment 


7 


106/1451 


151/1654 


<20% attrition 


4 


354/1485 


318/1352 


>20% attrition 


6 


1102/6609 


889/5206 



Weight 

% 



RR(95%CI) 



Chi 



Heterogeneity Overall 
' df P \',% effect, P 



♦ 

♦ 
♦ 

♦ 



0,2 
0.5 
0.2 
0.1 
0.1 
0.4 
9.3 
24.9 
64.1 
0.2 
100 



0.15 (0.01 to 2.93) 
0.18 (0.02 to 1.42) 
0.20 (0.01 to 4.22) 
0.27 (0.01 to 6.62) 
0.33 (0.01 to 8.02) 
0.67 (0.18 to 2.43) 
0.78 (0.61 to 0.99) 
0.91 (0.80 to 1.03) 
0.92 (0.85 to 1.00) 
0.93 (0.14 to 6.31) 



0.89 (0.84 to 0.96) 


7.52 


9 


0.58 


0 


0.001 


0.76 (0.60 to 0.96) 


0.8 


2 


0.67 


0 


0.02 


0.19 (0.04 to 0.85) 


0.15 


2 


0.93 


0 


0.03 


0.92 (0.85 to 0.98) 


0.58 


3 


0.9 


0 


0.01 


0.53(0.17to 1.70) 


0.51 


1 


0.48 


0 


0.29 


0.90 (0.84 to 0.96) 


6.38 


7 


0.5 


0 


0.001 


0.20 (0.01 to 4.22) 


NA 








0.3 


0.90 (0.84 to 0.96) 


6.59 


8 


0.58 


0 


0.001 


0.26 (0.07 to 1.06) 


0.4 


3 


0.94 


0 


0.06 


0.90 (0.84 to 0.96) 


4.82 


6 


0.57 


0 


0.002 


0.54 (0.21 to 1.36) 


1.21 


3 


0.75 


0 


0.19 


0.90 (0.84 to 0.96) 


5.54 


5 


0.35 


10 


0.002 


0.92 (0.85 to 0.98) 


0.97 


2 


0.61 


0 


0.01 


0.73 (0.58 to 0.91) 


3.83 


6 


0.7 


0 


0.006 


0.89 (0.78 to 1.00) 


4.18 


3 


0.24 


28 


0.05 


0.90 (0.83 to 0.97) 


3.37 


5 


0.64 


0 


0.008 



0.01 



100 



0.1 1 10 

Lower with NLC Lower with PLC 

Figure 4 Effects of physlclan-nurse substitution on mortality in A) all trials and by B) subgroups. Legend. CI, confidence interval; df degrees of 
freedom; N, number of patients with events; Total, total number of patients per group; RR, Relative Risk; Chi^, statistical test for heterogeneity; 
P, p-value of Chi^ (evidence of heterogeneity of intervention effects); 1^, amount of heterogeneity between trials; Overall P, p-value for significance of 
effects of interventions; NLC, Nurse-Led Care; PLC, Physician-Led Care; NP, Nurse Practitioner; NP+, Nurse Practitioner with higher degree/courses/ 
specialisation; RN, Registered Nurse. Andryukhin et al. (2010) [45] reported data at 5 and 18 months and was incorporated accordingly. 



hospital admissions with nurse-led care [23,36] or no 
significant differences between groups [22,44,36] (see 
Additional file 1: Table S7). Qualitative data reported less 
hospital admissions with nurse-led care at 24 months [25] 
or no significant differences between groups at 1 month 
[26] or 12 months [34]. 

Mortality 

Ten trials had sufficient data for meta-analysis, one of 
which reported different follow-up episodes [46] (Figure 4). 
The pooled RRs showed a significant reduction in the risk 
of all-cause mortality with nurse-led care (RRs 0.89, 95% 
CI 0.84 to 0.96) and no significant heterogeneity between 
trials (I^ = 0%; xidf = 7.52; p = 0.58). Subgroup analyses 
by clinical characteristics showed that NPs had an in- 
creased effect but less significant than RN/LN in reducing 
all-cause mortality (NP: RRs 0.76, 95% CI 0.60 to 0.96; 



RN/LN: RRs 0.92, 95% CI 0.85 to 0.98). Although NPs+ 
showed an increased estimate, the CIs were wide and less 
significant (RR 0.19, 95% CI 0.04 to 0.85). The estimate in- 
creased in studies of on-going care, non-urgent visits and 
longer (at least 12 months) follow-up episodes but the CIs 
and significance remained the same. The effect disap- 
peared in trials of single contact, urgent care visits (n = 1) 
and shorter (less than 12 months) follow-up episodes. The 
estimate increased, although with wider CIs and less sig- 
nificance, in trials with inadequate allocation concealment 
(RRs 0.73, 95% CI 0.58 to 0.91) and in trials with at least 
20% attrition (RRs 0.90, 95% CI 0.83 to 0.97). On the other 
hand, the estimate decreased, with reduced significance, in 
trials of adequate concealment and trials of less than 
20% attrition and disappeared in small trials (N < 200, 
RR 0.54, 95% CI 0.21 to 1.36). Heterogeneity between 
trials remained non-significant in all subgroups 
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Scale / Study 



Nurses Physicians 
patients, N patients, N 



SIVID(95%CI) 



Houweling, 2011 


0.15 


0.1505 


85 


93 


IVlundinger, 2000 


0.02 


0.063S 


220 


165 


IVIundinger, 2000 


-0.01 


0.1046 


222 


184 


IVIundinger, 2000 


0.02 


0.0995 


649 


391 


SF-12 










Lnan, zuuy 


0.7 


0.1556 


89 


86 


Euroqol SD (0=dead,l=full health) 










Diericl<-van Daele, 2009 


0.06 








Euroqol 5D (-0.59=worst,l= best) 










Jarman, 2002 


-0.06 


0.0561 


696 


558 


Euroqol 5D by VAS 










DuMoulin, 2007 


0.12 


0.3597 


35 


10 


Kernick, 2000 


0.26 


0.227 


35 


46 


Dermatology Life Quality Index 










Kernick, 2000 


-0.32 


0.2245 


35 


46 


Clinical score for Eczema and Psoriasis 










Kernick, 2000 


-0.15 


0.2245 


35 


46 


Global General Questionnaire for Parl<inson's Disease 








Jarman, 2002 


0.16 


0.0561 


696 


558 


PQ-39 for Parkinsons' Disease 










Jarman, 2002 


-0.02 


0.0561 


696 


558 


Respiratory Illness Questionnaire (RIQ) 










Hesselink, 2004 


0.21 


0.1531 


115 


94 


Hesselink, 2004 


0.06 


0.1403 


96 


80 


Minnesota Living with Heart Failure Questionnaire 








Andryukhin, 2010 


1.3052 


0.6819922 


40 


35 


Asthma related QoL instrument (ARQoL) 










Kernick, 2002 


0.6396 


0.6921 


55 


46 


Kernick, 2002 


1.037 


1.0878 


55 


46 



0.15 (-0.14 to 0. 
0.02 (-0.11 to a 
-0.01 (-0.22 too. 
0.02 (-0.18 to 0, 

0.70 (0.40 to 1 

0.06 (-0.08 to 0, 

-0.06 (-0.17 too. 

0.12 (-0.58 to 0, 
0.26 (-0.18 to 0, 

-0.32 (-0.76 too. 

-0.15 (-0.59 too. 

0.16 (0.05 too 

-0.02 (-0.13 too. 

0.21 (-0.09 to 0, 
0.06 (-0.21 to 0, 



-4 -2 
Better with PLC 



,44) 
.15) 
20) 
22) 

,00) 

,20) 

05) 

,82) 
,70) 

12) 

29) 

.27) 

09) 

,51) 
,33) 



1.31 (-0.03 to 2.64) 



0.64 (-0.72 to 2, 
1.04 (-1.10 to 3, 



,00) 
17) 



2 4 

Setter with NLC 



Figure 5 Comparison of individual trial estimates of the effect of physician-nurse substitution on Quality of Life. Legend. A { 
estimate was not possible due to the various scales used, grading scores and measurements. SMD, standard mean difference; SE, standard ( 
N, total number of patients per group; CI, confidence interval; NLC, Nurse-Led Care; PLC, Physician-Led Care. 



although low heterogeneity was introduced in trials with 
less than 20% attrition and smaller trials. The funnel 
plot was asymmetrical showing five trials falling to the 
left (nurse-led care with fewer events), two on the right 
and three on the line of no effect. Data that could not 
be pooled showed a significantly lower cumulative rate 
of all-cause mortality and a marginal significance in the 
cumulative rate of mortality due to coronary/non-fatal 
myocardial infarction with nurse-led care at 56.4 
months [44] (Additional file 1: Table S7). Qualitative 
data reported to have no documentation of death after 
12 months follow-up [36]. 

Sensitivity analyses 

In the meta-analyses (Figures 2, 3 and 4), excluding the 
studies in which nurses had full clinical autonomy or from 
which this information was not obtainable did not critically 
alter the estimates (Additional file 1: Table S6). The small 



non-significant amount of heterogeneity in the meta- 
analysis of hospital admissions was attributable to a small 
study which favoured physician-led care but had wide CIs. 
Excluding quasi-RCTs or cluster RCTs from the meta- 
analyses of patient satisfaction and mortality slightiy re- 
duced the pooled estimate but did not alter the direction of 
effects and the findings remained significant. 



Random-effects meta-analyses 

Meta-analyses using a RE model showed the same direc- 
tion of effect. The pooled estimates and heterogeneity 
remained significant. Patient satisfaction showed an in- 
creased estimate although wider CIs (SMD 0.31, 95% CI 
0.12 to 0.514, p = 0.002; f = 91%; xldf = 65.97; p < 0.0001). 
Hospital admissions (RRs 0.77, 95% CI 0.63 to 0.94, p = 
0.01; I^ = 7%; xldf = 4.30; p = 0.37) and mortality (RRs 0.90, 
95% CI 0.84 to 0.96, p = 0.002; f = 0%; 9df = 7.52; 
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p = 0.58) showed very similar estimates of a more mod- 
est but yet significant effect. 

Quality of life 

Four [27-29,37] of the thirteen [17,22,23,27-30,33,37, 
39,42,44,46] trials with measures on QoL used both 
disease-specific and generic scales of functional health and 
well-being. Other seven [17,22,23,30,39,42,44] used only 
generic scales and two used only disease-specific scales 
[33,46]. Due to the different scales, grading scores and mea- 
surements, we decided not to combine trials in a pooled 
analysis (Figure 5). Comparison of the individual estimates 
of trials using generic scales showed nurse-led care signifi- 
cantly improved QoL scores with the SF-12 at 6 months 
(SMD 0.70, 95% CI 0.40 to 1.00) and with the Global Gen- 
eral Questionnaire for Parkinsons Disease at 24 months 



(SMD 0.16, 95% CI 0.05 to 0.27). Estimates from trials using 
the SF-36 and Euroqol did not reach significance although 
some favoured nurse-led care. Trial estimates [27,33,46] 
using disease-specific scales at 4, 6, 12, and 24 months 
favoured nurse-led care but were not significant. Four trials 
reported better scores with nurse-led care in various indi- 
vidual dimensions of the ARQoL, SF-36 and RIQ question- 
naires but the overall score was not significant at two weeks 
[45] or not sustained at least 12 month thresholds 
[27,33,44] except for patients with incontinence for whom 
better scores of individual dimensions at 6 months persisted 
at 12 months or reached a significant overall score (re- 
ported p < 0.05) [37]. Qualitative data based on generic 
scales reported significance (general health questionnaire) 
[23] or non-significance (SF-36, EQ5D-VAS) in the overall 
score at 0.5 [17] or at 4 [27] months. 
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Costs 

There were six trials [17,23,29,35,42,44] with data on cost 
and two [31,38] comprehensive economic evaluations. 
Due to the large variety of approaches used to value the 
resources and calculate cost we didn't pool trials in a 
meta-analysis. Figure 6 shows the comparison of the indi- 
vidual trial estimates. Costs were generally lower with 
nurse-led care in direct costs including consultations 
within study practices, for all patients and in patients not 
yet 65 years old, in study practices (compared to external 
reference practices) [38] at 0.5 or 12 [23] months, and in 
treatment costs with both unadjusted and adjusted data at 
6 months [42]. On the other hand, the mean cost per 
quality adjusted life years (QALYs) at the end of 56.4 
months and the cost of interventions (clinics and drugs) 
were significantly higher with nurse-led care in one trial 
[44]. Another trial showed lower costs with nurse-led 
care based on face-to-face total cost of clinicians (total 
consultation time without the time to get prescriptions 
signed by physicians or time taken to sign a prescription) 
[17]. The studies also showed no significant differences 
between nurses and physicians in direct and productivity 
costs for consultations in all patients at study practices 
[38], direct and productivity costs for consultations in all 
patients or for patients not yet 65 years old at study prac- 
tices (compared to external reference practices) [38], in 
the costs of care based on either the total time or face-to- 
face time given by the nurse or physician [17] or other 
healthcare system costs (hospitals, outpatient attendances 
and admissions to private hospitals) [44] . 

Other trials reported lower healthcare costs with nurse- 
led care at 6 to 56.4 months [23,35,44] and no significant 
differences between groups in net healthcare costs [29]. 

Discussion 

Substitution of physicians by nurses is often discussed 
and widely practiced in many countries, with the aim of 
satisfying the demands of an aging population and 
(local) shortages of physicians. Our review showed that 
the volume of rigorous evaluations is slowly increasing 
but remains low. In addition, the quality of available re- 
search does allow strong recommendations for practice 
and policy, despite previous proposals [6,7]. 

In the appraised literature, the nurses assessed a wide 
variety of conditions and performed various tasks, with 
different degrees of clinical autonomy and in different 
settings. Despite this heterogeneity and the substantial 
methodological limitations, our review suggested that 
nurse-led care is associated with higher patient satisfac- 
tion, lowered overall mortality and lowered hospital ad- 
missions. Effects on other outcomes, such as QoL and 
costs remained inconclusive. 

The effect of nurse-led care on hospital admissions 
and mortality was particularly present in studies of on- 



going care and non-urgent visits and when nurse practi- 
tioners (both NP and NP with higher degree/courses) 
provided the care. This suggests that trained nurses can 
effectively provide healthcare to patients with established 
diseases. However, the effect disappeared (for hospital 
admissions) or weakened (for mortality) in studies with 
better or adequate concealment of allocation and in lar- 
ger studies. The reasons for this surprising and import- 
ant finding, especially that nurse-led care could lead to 
reduced mortality, should be addressed in future studies. 

Our overall results also showed a highly significant ef- 
fect of nurse-led care on patient satisfaction although 
with severe heterogeneity between trials. This finding is 
consistent with previous reviews [3,4]. Nevertheless, this 
result should be interpreted with caution. Although the 
average effect is positive, subgroups of patients reported 
less positive views. Our results suggest this variability 
may be due to nurses' roles or study size, which may be 
associated with other factors (such as degree of clinical 
autonomy). The effect disappeared when we considered 
only the trials based on on-going care or non-urgent 
care, and in trials with longer follow-up episodes (at 
least 6 months), but these subgroups included two trials 
only. Surprisingly, patient satisfaction was higher with 
general nurses (as compared to NPs or NPs with higher 
degree/extra courses), but the two very small studies 
showing this effect addressed tasks for very special con- 
ditions such as incontinence and family planning. This 
finding fits in with previous research which showed that 
patients appreciate nurses' involvement especially in 
education and counselling [47,48]. 

The results on QoL were difficult to interpret due to 
heterogeneous reporting of outcomes and the data that 
were scattered across different scales with outcome 
measurements at variable follow-up time intervals. 
Only a few trials used both generic and disease-specific 
scales with primarily one trial per scale. There was a 
potential increase in QoL scores with nurse-led care, 
when health status was evaluated using generic scales, 
or for specific conditions (e.g. heart failure, Parkinson's 
Disease) but the effect was not significant or not sus- 
tained at length (at least 12 months) or it was contra- 
dicted by data from the same studies [28]. Similarly, 
there were some effects of lower costs with nurse-led 
care, but the reported data used different approaches to 
value the resources and to calculate costs in only a few 
trials and economic evaluations. 

Methodological appraisal of Included studies 

We identified several significant limitations in the 
current evidence which should be considered in future 
research. The trials included were highly heterogeneous 
in terms of tasks, settings, collection and reporting of 
outcome measurements. There is a considerable amount 
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of data that are reported in descriptive accounts only, 
limiting both their pooled validity and the interpretation 
of their results. Additionally, many studies failed to re- 
port some important statistical information (e.g. sample 
sizes, mean scores, SDs) required to calculate trial esti- 
mates and to integrate them in a meta-analysis. 

No study fulfilled the set of methodological quality cri- 
teria assessed, despite widely available guidelines for RCTs. 
Trials of lower methodological quality (small study, at 
least 20% attrition and lack/unclear allocation conceal- 
ment) tended to inflate the results and only less than 50% 
of the trials maintained the least target sample required 
to achieve power, which makes results less trustworthy. 
The most probable small study bias affecting the effect 
sizes are the results of small negative studies which are 
generally less likely to be published than small studies with 
positive results (i.e. publication bias). Blinding (clinicians, 
patients and outcome assessors) was reported in only a 
few trials and we don't rule out the possibility that patient 
satisfaction, a subjective outcome, may have been espe- 
cially positively affected by this. The trials consisted of 
follow-up episodes of variable length (0.5 to 122.4 
months) which may have limited the true effect of care es- 
pecially in multi-morbid or serious illnesses. Our analyses 
partly explained the reasons for heterogeneity where this 
was present but several other variables, which we could 
not account for, may have also caused this. Patients' per- 
ception and evaluation of satisfaction may be inherently 
subjective due to socio-demographic differences, experi- 
ences from previous care, the physical environment, and 
patient-care provider interactions. Therefore, measure- 
ments of outcome using validated tools are preferred. Of 
the trials appraised, less than 50% used validated question- 
naires for patient satisfaction. 

We also identified a lack of trials of cluster randomisa- 
tion. Although these may be more complex in design, if 
accounted for all key factors including clustering effect, 
appropriate sampling and analyses, cluster RCTs could 
add important value to the current evidence. 

Surprisingly, there is a dearth of economic data. The 
little evidence available on the cost of physician-nurse 
substitution relies on results which are mainly based on 
direct costs and use variable approaches. The more re- 
cent literature reports more economic data, but it seems 
difficult to integrate these results especially because cost 
evaluations differ across countries and thus in cost mea- 
surements. We found only two publications [31,38] pro- 
viding economic data related to three of the included 
trials. Despite continued claims of substituting physi- 
cians by nurses based on healthcare costs, the evidence 
can only suggest that substitution is cost neutral. There- 
fore, as suggested in a recent systematic review of eco- 
nomic evaluations [49], to meaningfully place the costs 
and consequences of substitution in the context of 



healthcare, studies should address all types of costs. 
Relevant and appropriate data should be generated by 
means of a systematic collection of economic measures, 
and specific rules for cost data estimations should also 
be defined and followed. 

More intensive implementation could enhance the 
outcomes of nursing care, but most studies do not pro- 
vide the necessary information. In the evaluated studies, 
the assumption is that nurses possess the competence 
required for substituting physicians, but the level of sub- 
stitution does not seem equal among studies. While the 
level of training may be a critical factor for an effective 
outcome, the studies report incomplete descriptions of 
nurses' roles and competencies. The level of clinical au- 
tonomy in nurses does not seem consistent with the 
level of training and the tasks performed. Also, nurses 
still require support or communication with the phys- 
ician for various tasks. It seems then that the level of 
qualification and training required to carry out substitu- 
tion requires yet a better definition of practice boundar- 
ies including a classification of tasks. Better criteria 
conceptualised to define nurses' roles and responsibil- 
ities are needed. In addition, the various differences be- 
tween countries' definitions and their organisation of 
nurse care should be taken into account. Lastly, more 
than half of the evidence reviewed (62.5%) has been con- 
ducted in Europe, mainly the UK and the Netherlands. 

It is apparent that there is much room for primary 
studies that include larger numbers of patients, meth- 
odologically more rigorous in terms of quality, compre- 
hensive in terms of data and statistical methods and 
with longer follow-up episodes. Furthermore, in order to 
gain a better understanding of substitution, future re- 
search should map a wider range of nurses, the various 
levels of training and clinicians' characteristics, which 
are provided in many countries. As suggested previously 
[50], each method of skill- mix may have its own 
strengths and weaknesses. The implementation of meth- 
odologies aiming at the standardisation of skill-mix stud- 
ies could support a sound assessment such that health 
sector reform may also benefit from the publication of 
evidence. 

Strengths and limitations of the review 

Our review updates and extends earlier systematic re- 
views [3,4] and benefits from a thorough assessment of 
RCTs, in which the nurse acted as the main figure of 
care. It also presents (where available) the results by 
nurses' roles. Having used the fixed effect model, we can 
only make inferences about the studies included in the 
meta-analyses performed here. We only included RCTs 
because these are at a lower risk of bias and allow for 
the identification of causal relationships. Although non- 
randomised trials may overestimate the benefits of 
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nurse-led care it would be recommended to scrutinise 
the current evidence with such designs. These may not 
only provide an opportunity for an update but also allow 
for the collection of data from long term (more than 12 
months) follow-up designs which may consist of larger 
sample sizes. We only included publications in English. 
We did however screen the reference lists of relevant re- 
views (some in foreign languages) and searched the ref- 
erence lists of all included studies. We did not contact 
authors for further information nor did we search for 
grey literature. A further limitation is that it was often 
difficult to understand in detail what role and responsi- 
bilities nurses had, when substituting physicians. In 
many cases, they remain embedded in patient care teams 
that also involved physicians. 

Conclusion 

The slowly growing number of studies, assessing substitu- 
tion of physicians by nurses is still substantially limited by 
methodological deficiencies. Also, the current evidence 
belongs to a small selection of healthcare systems lacking 
good quality data. Nevertheless, nurse-led care seems to 
have a positive effect on hospital admissions and mortal- 
ity. This important finding should be confirmed and the 
determinants of this effect should be assessed in future 
studies. Before implementing new changes in the delivery 
of healthcare, further, larger and more methodically rigor- 
ous primary research should address the quality of 
the data on both health outcomes and costs. Primary 
research should also differentiate between types of nurses, 
qualifications and tasks. In particular, we recommend con- 
sidering the role of multidisciplinary teams in which 
nurses are embedded, also when substituting physicians in 
specific clinical tasks. 
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